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1.0  Abstract 

Interconnect  scaling  to  deep  submicron  processes  pre¬ 
sents  many  challenges  to  today9 s  CAD  flows.  A  recent  analy¬ 
sis  by  Sylvester  and  Keutzer  examined  the  behavior  of 
average  length  wires  under  scaling ,  and  controversially  con¬ 
cluded  that  current  CAD  tools  are  adequate  for  future  mod¬ 
ule-level  designs .  In  our  work ,  we  show  that  average  length 
wire  scaling  is  sensitive  to  the  technology  assumptions, 
although  the  change  in  their  behavior  is  small  under  all  rea¬ 
sonable  scaling  assumptions .  However,  examining  only  aver¬ 
age  length  wires  is  optimistic,  since  long  wires  are  the  ones 
that  primarily  cause  CAD  tool  exceptions.  In  a  module  of 
fixed  complexity,  under  both  optimistic  and  pessimistic  scal¬ 
ing  assumptions,  the  number  of  long  wires  will  increase 
slowly  with  scaling.  More  importantly,  as  the  overall  die 
capacity  grows  exponentially,  the  number  of  modules  and 
thus  the  total  number  of  wires  in  a  design,  will  also  increase 
exponentially.  Thus,  if  the  design  team  size  and  per-de signer 
workload  is  to  remain  relatively  constant,  future  CAD  tools 
will  need  to  handle  long  wires  much  better  than  current  tools 
to  reduce  the  percentage  of  wires  that  require  designer  inter¬ 
vention. 

2.0  Introduction 

With  process  technologies  capable  of  fabricating  a  billion 
transistor  chip  on  the  horizon,  the  CAD  community  faces 
many  new  challenges.  Notably,  interconnect  scaling  in  deep 
submicron  processes  may  force  a  fundamental  change  in  cur¬ 
rent  ASIC  design  methodologies.  If  interconnect  delay 
becomes  a  significant  fraction  of  total  delay,  timing  conver¬ 
gence  for  standard-cell  design  blocks  will  become  difficult  or 
impossible  to  achieve  with  current,  crude,  fanout-based  wire 
load  models.  Designers  will  need  new  tools  and  methods  to 
synthesize  large  blocks  in  deep  submicron  technologies. 

In  a  1998  ICC  AD  tutorial,  Sylvester  and  Keutzer  carried 
out  a  detailed  analysis  of  interconnect  scaling  and  its  potential 
effects  on  CAD  methodologies  [1][2],  By  examining  the  scal¬ 
ing  of  average  length  wires,  they  concluded  that  CAD  tools  are 
adequate  for  future  module-level  designs.  We  examine  the  sen¬ 
sitivity  of  their  analysis  to  a  range  of  possible  technology  scal¬ 
ing  assumptions  by  modeling  their  simulations  with  an  RC 
tree  delay  model. 

Since  design  speeds  and  timing  convergence  in  synthesis 
flows  are  typically  constrained  by  long  wires,  not  average- 
length  ones,  we  extend  the  analysis  to  long  wires.  We  show 
that  for  a  fixed  complexity  design,  the  number  of  long  wires 
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grows  slowly  with  scaling.  If  chip  complexity  remained  con¬ 
stant,  this  increase  in  long  wires  could  be  handled  by  small 
improvements  in  today’s  CAD  design  flow.  Unfortunately 
exponentially  increasing  die  capacity  exacerbates  the  increas¬ 
ing  number  of  long  wires  per  module  by  driving  up  the  num¬ 
ber  of  modules.  Thus,  with  constant  design  team  size,  the 
number  of  gates  per  designer  will  grow  exponentially.  To  pre¬ 
vent  the  per-designer  workload  from  also  growing  exponen¬ 
tially  the  percentage  of  wires  that  need  manual  intervention 
must  fall  exponentially.  This  implies  that  future  tools  must 
handle  a  greater  percentage  of  long  wires  without  designer 
intervention.  This  is  the  key  challenge  that  the  CAD  commu¬ 
nity  must  face. 

3.0  Underlying  problem  in  synthesis 

Today’s  CAD  methodology  does  not  guarantee  conver¬ 
gence  to  a  final  solution  that  meets  timing  constraints  due  to 
the  discrepancy  between  wire-load  estimates  used  by  synthesis 
during  logic  optimizations  and  the  actual  wire-loads  after  lay¬ 
out.  The  initial  synthesis  step  in  this  methodology  decides  the 
overall  logic  structure  of  the  netlist  using  fanout-based  wire¬ 
load  models  supplied  by  the  standard-cell  library  vendor.  This 
wire-load  model  is  derived  from  statistical  analyses  of  past 
designs  in  the  library  and  represents  the  median  of  the  wire¬ 
load  distribution  for  each  fanout.  However,  the  post-layout 
wire  capacitance  has  a  Poisson  distribution  with  a  narrow  peak 
around  the  statistical  wire-load  length  and  a  long  tail  to  the 
right.  Figure  1  shows  the  discrepancy  between  post-layout  and 
statistical  wire-loads  for  a  small  design,  where  the  nets  are 
sorted  by  fanout  and  post-layout  capacitance. 


Fig.  1.  Estimated  and  actual  wire  loads  ([3]) 

Thus,  even  though  synthesis  estimates  the  wire-load  of 
short  and  medium-length  nets  with  reasonable  accuracy,  it 
highly  underestimates  the  longer  nets.  For  these  long  nets, 
synthesis  would  pick  the  wrong  logic  structure  to  drive  such  a 
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net  and  would  not  account  for  the  intrinsic  wire  delay.  These 
nets  cause  the  appearance  of  new  critical  paths  to  be  fixed  by 
incremental  optimizations  using  back-annotated  wire-loads, 
but  currently  these  optimizations  are  limited  to  gate  sizing, 
buffer  insertion,  and  critical  path  re-synthesis.  Any  changes 
need  to  be  merged  into  the  existing  layout  without  perturbing 
the  unchanged  logic.  If  the  layout  is  changed  too  much,  the 
back-annotation  used  by  incremental  synthesis  optimizations 
will  be  significantly  different  from  actual  wire-loads,  possibly 
resulting  in  a  new  set  of  critical  paths.  Finally,  long  wires  have 
intrinsic  delay  (from  their  self-resistance)  so  it  might  not  be 
possible  to  meet  critical  path  timing  using  this  placement. 
Hence  synthesis  cannot  guarantee  that  incremental  optimiza¬ 
tions  will  be  able  to  meet  timing  constraints. 

Arguably,  CAD  methodologies  need  only  enable  the  com¬ 
mon  case  of  average  wires,  and  the  designer  will  manually 
handle  the  exceptions.  This  CAD  model  is  tenable  (and  mar¬ 
ketable)  only  if  the  number  of  exceptions  remains  at  a  reason¬ 
able  level.  To  gain  further  insight  into  the  future  need  for 
module-level  CAD  tool  improvements,  Sylvester  and  Keutzer 
examined  the  behavior  of  average  length  wires  under  scaling. 
In  the  next  section,  we  review  and  extend  their  analysis. 

4.0  Average  length  wires 

Sylvester  and  Keutzer  were  motivated  by  popular  forecasts 
claiming  that  as  technologies  scale,  interconnect  delays  will 
increase  and  eventually  dominate  module-level  timing  [4]  [5]. 
At  what  size  design,  they  asked,  would  traditional  design  flows 
using  crude  interconnect  models  be  unable  to  converge  on  a 
design  that  would  meet  timing  constraints? 

To  answer  this  question,  they  constructed  strawman  tech¬ 
nologies  for  generations  from  L=0.25pm  to  L=0.05pm  that 
were  generally  more  conservative  than  those  described  in  the 
SIA  roadmap  [4].  Using  these  technologies,  they  simulated  a 
ring  oscillator  with  and  without  average  length  wires  between 
the  gates  to  get  intrinsic  gate  delay  and  interconnect  delay. 
When  adding  wires,  the  gates  were  upsized  until  the  incremen¬ 
tal  upsizing  speed  benefit  was  under  2%.  Since  their  simula¬ 
tions  showed  that  under  scaling,  interconnect  delay  fell  relative 
to  total  delay,  they  concluded  that  with  proper  upsizing,  inter¬ 
connect  scaling  will  not  prevent  current  CAD  flows  from 
working  for  future  50K  gate  designs. 

4.1  Reformulation  of  the  analysis 

Here  we  reformulate  their  simulation  study  (sections  7.1.2 
and  7.1.3  of  [1])  in  an  equivalent  analytical  approach.  We 
model  transistors  by  an  effective  resistance  and  measure  delay 
by  calculating  the  RC  time  constants  [6].  Consider  a  fanout=2 
ring  oscillator  of  inverting  gates.  Without  wires  between  the 
stages,  the  “intrinsic  gate  delay”  of  each  stage  is  independent 
of  device  size  and  is  shown  in  EQ  1 .  Next  we  add  an  average- 
length  wire  between  stages  and  upsize  the  gates.  The  iterative 
upsizing  algorithm  used  in  [1]  arrived  at  the  same  device  sizes 
as  well-known  circuit  design  techniques  that  aim  for  effective 
fanouts  to  be  around  4  for  optimal  performance  [7].  Since  the 


line  resistance  of  average  length  wires  (230pm  long  in  a 
0.25pm  technology)  is  much  less  than  the  driver  resistance,  we 
will  follow  Sylvester  and  Keutzer’s  lead  and  ignore  the  wire 
resistance.  The  delay  with  wires  is  shown  in  EQ  21. 

^  nowire  =  RdMCgale  +  C di/f)  (EQ  1  ) 

Dwithwire  =  R+ACwtrr  +  2 Cxote  +  Cdiff )  (EQ  2) 

Since  the  Dnowire  (intrinsic  gate  delay)  term  is  not  depen¬ 
dent  on  device  size,  we  assume  that  the  devices  are  sized  the 
same  in  both  cases,  so  that  the  Rdrv  terms  are  equal.  Then  the 
ratio  of  interconnect  delay  to  total  delay  can  be  written  as 


Ratio  - 


Cwir 


Cwire+  2Cgate  +  Cdiff 


(EQ  3) 


As  we  scale  to  future  processes,  we  can  apply  their  scaling 
heuristics  [2],  where  S  is  the  process  scaling  factor  (S=0.7  per 
generation). 

Using  these  scaling  factors,  the  numerator  of  the  delay  ratio 
decreases  faster  than  the  denominator,  and  thus,  the  ratio  of 
wire  delay  to  total  delay  decreases.  If  we  substitute  appropriate 
numbers  for  the  capacitances,  we  get  ratios  close  to  their  simu¬ 
lation  results  (39%  and  26%  from  simulation  versus  38%  and 
25%  from  the  model,  for  0.25pm  and  0.10pm  technologies 
respectively).  Since  this  RC  model  matches  the  simulation- 
based  analysis  of  [1],  we  can  use  it  to  examine  the  sensitivity 
of  the  analysis  to  variations  in  scaling  assumptions. 


4.2  Revisiting  the  underlying  assumptions 

For  the  interconnect  delay  to  total  delay  ratio  to  fall,  wire 
capacitances  must  scale  down  faster  than  gate  and  diffusion 
capacitances.  However,  how  wires,  gates,  and  diffusions  will 
actually  scale  is  unclear.  Sylvester  and  Keutzer  scaled  wire 
capacitance  per  unit  length  by  0.85  per  generation  for  Iow-k 
dielectrics,  which  is  slower  than  the  SIA  roadmap  but  more 
optimistic  than  other  projections  [5].  They  scaled  gate  capaci¬ 
tances  down  faster  than  S,  since  they  projected  that  Tox  would 
decrease  only  slowly.  Yet  the  use  of  high-K  oxide  dielectrics 
would  allow  aggressive  scaling  of  an  effective  Tox  [8][9][10]. 
Finally,  although  diffusion  capacitances  were  scaled  down 
slower  than  S,  shallow  trench  isolations  (STIs)  or  simply 
legged  devices  can  cause  Cdig to  scale  faster  than  S  [1 1].  Sili¬ 
con-on-insulator  (SOI)  devices  could  also  reduce  the  impact  of 
diffusion  capacitance  drastically. 

Some  scaling  scenarios  (e.g.,  effective  1ow-k  dielectrics, 
aggressive  Tox  scaling,  and  no  diffusion  capacitance  reduction) 
imply  that  wire  capacitance  falls  faster  than  gate  and  diffusion 
capacitance  and  that  the  average  interconnect  to  total  delay 
ratio  will  decrease  with  scaling.  However,  other  scenarios  ( e.g ., 
interconnect  1ow-k  scaling  at  0.625  -  just  5%  slower  than  pre¬ 
viously  assumed,  Tox  scaling  as  previously  assumed,  but  STIs 


1 .  The  Rdrv  term  includes  a  factor  (close  to  Ini)  that  accounts  for  the 
fact  that  we  are  interested  in  the  delay  to  the  switching  point  of  the 
next  gate,  which  is  taken  to  be  Vdd/2. 
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and  legged  devices  causing  diffusion  capacitance  to  scale 
faster  at  S2*S'0,5  =  0.61)  imply  that  wire  capacitance  actually 
grows  compared  to  gate  and  diffusion  capacitance.  In  such 
cases,  the  interconnect  to  total  delay  ratio  will  increase  with 
scaling.  Thus  the  analysis  is  sensitive  to  the  scaling  assump¬ 
tions  used  and  can  lead  to  opposite  conclusions  given  reason¬ 
able  but  slightly  different  scaling  assumptions. 

The  critical  point  to  notice  is  that  in  all  these  scenarios, 
regardless  of  what  scaling  assumptions  are  used,  the  average 
interconnect  to  total  delay  ratio  does  not  change  that  much,  up 
or  down,  between  successive  technology  generations.  This 
result  confirms  Sylvester  and  Keutzer  conclusion:  current 
CAD  tools  will  be  able  to  handle  average  wires  in  future  tech¬ 
nologies.  However,  restricting  the  analysis  to  these  short  aver¬ 
age  wires  can  be  optimistic,  since  timing  problems  can  arise 
due  to  long  wires  and  their  appreciable  resistances. 

4.3  Average  wire  length  determination 

Sylvester  and  Keutzer  derived  average  wire  lengths  using 
empirically  fitted  Rent  parameters  and  Donath  statistics  [12]. 
One  problem  with  using  average  wire  lengths  is  that  they  are  a 
very  weak  function  of  the  total  gate  count.  Using  the  Donath 
formulation  for  average  wire  length  [12],  Figure  2  shows  aver¬ 
age  wire  length  as  a  function  of  gate  count  for  various  values  of 
Rent’s  exponent.  Sylvester  and  Keutzer  examined  a  number  of 
designs  and  determined  that  the  average  Rent’s  exponent  was 
approximately  p=0.7.  The  corresponding  curve  below  shows 
that  as  gate  count  grows  from  5K  to  50K,  a  ten-fold  increase, 
average  wire  length  only  increases  by  a  factor  of  1 .66. 


Fig.  2.  Average  wire  length  versus  gate  count 

This  weak  relationship  between  average  wire  length  and 
gate  count  arises  since  computation  and  communication  in 
VLSI  designs  tends  to  be  spatially  local.  Designers  often  con¬ 
struct  large  gate  count  blocks  by  accreting  several  smaller 
blocks  together,  thus  keeping  the  average  wire  length  fairly 
short.  However,  synthesis  convergence  is  most  vulnerable 
when  unpredicted  wire  loads  and  delays  add  to  critical  paths, 
which  occurs  with  long  wires,  not  with  average  length  ones. 

5.0  Long  wires 

A  “long”  wire  can  be  defined  simply  as  one  whose  intrinsic 
delay  is  some  fraction  of  a  gate  delay  in  a  given  technology. 
The  gate  delay  we  use  is  a  fanout-of-four  inverter  (F04)  delay. 
A  wire  is  considered  long  if  its  length  exceeds  Lcrit: 


<EQ4) 

Rw  and  Cw  are  per-unit  length  wire  resistance  and  capaci¬ 
tance.  In  this  paper,  we  will  choose  the  fraction  a  as  half  of  a 
F04  delay,  although  the  exact  value  is  not  central  to  the  analy¬ 
sis.  For  a  typical  0.25pm  process,  a  F04  delay  is  about  90pS, 
and  a  minimum  width  copper  M1/M2  wire  has  a  capacitance  of 
about  0.15fF/|im  and  a  resistance  of  about  150mQ/^im.  Thus, 
the  Lcrit  for  this  technology  is  about  1.3mm,  which  is  certainly 
longer  than  the  average  length  wire,  but  still  a  realizable  length 
in  modestly  sized  modules.  For  example,  even  with  an  optimis¬ 
tic  area  utilization  of  100%  and  a  cell  pitch  of  10pm,  a  50K 
gate  block  still  has  a  semi-perimeter  of  more  than  4mm.  In 
Section  5.1,  we  will  derive  estimates  of  the  number  of  wires 
that  exceed  Lcrit  for  a  given  design  size  by  examining  the  wire 
length  distribution. 

Long  wires  present  two  problems  for  today’s  CAD  method¬ 
ologies.  First,  long  wires  have  large  parasitic  capacitances,  so 
that  driving  gates  can  be  undersized.  Simple  upsizing  will 
match  the  driver  to  the  wire,  but  at  a  cost  of  ramping  up  in  pre¬ 
vious  gates  and  across  other  fanout  paths,  leading  to  timing 
divergence  as  described  in  Section  3.0.  Second,  no  matter  how 
large  we  size  the  driver,  the  intrinsic  wire  RC  delay,  unpre¬ 
dicted  during  synthesis,  remains  constant.  Various  techniques, 
such  as  widening  wires  or  relayout,  mitigate  the  effects  of  long 
wires  but  are  not  part  of  today’s  standard  CAD  flow. 

Under  scaling,  the  DF04  scales  by  S,  Rw  scales  by  approxi¬ 
mately  1/S2  (assuming  nearly  constant  aspect  ratio),  and  Cw 
scales  by  between  1  and  S  (depending  on  the  use  of  1ow-k 
dielectrics).  Thus,  Lcrit  scales  by  between  S  and  S1'5,  meaning 
that  with  scaling,  more  and  more  wires  in  a  constant  complex¬ 
ity  design  will  exceed  Lcrit.  The  scaling  of  Lcrit  in  gate  pitches 
is  shown  below  for  both  SIA  and  Sylvester/Keutzer  projec¬ 
tions,  assuming  that  gate  pitches  scale  with  process  (starting  at 
10pm  in  the  0.25^m  technology).  Note  that  Lcrit  scales  down 
slowly,  only  changing  by  1.6  between  5  technology  genera¬ 
tions. 
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Fig.  3.  Lcrit  under  scaling 

Using  different  scaling  factors  can  lead  to  different  slopes 
of  the  above  curve,  but  all  reasonable  scaling  assumptions  lead 
to  some  decrease  in  Lcrit  with  scaling.  Thus,  the  number  of 
wires  that  exceed  Lcrit  will  increase  with  scaling.  To  give  us  an 
idea  of  the  number  of  nets  that  will  require  manual  intervention 
by  the  designer(s),  the  next  section  examines  the  tail  of  the 


wire  length  distribution  to  determine  the  percentage  of  wires 
exceeding  Lcrit  (“long”  wires)  in  a  design  of  a  given  gate  count. 

5.1  Wire  length  distributions 

Davis  et  al.  constructed  a  model  for  wire  length  distribu¬ 
tions,  given  Rent  parameters  and  gate  count  [13],  and  showed 
it  was  a  good  fit  for  many  designs.  We  applied  their  model  to 
four  specifically  synthesized  blocks:  three  units  from  the  M- 
Machine,  a  fine-grained  multicomputer  designed  at  MIT  and 
Stanford  [14],  and  the  global  placement  of  Magic,  a  synthe¬ 
sized  controller  chip  from  Stanford’s  Flash  multiprocessor  [15] 
(minus  the  artificially  long  hand-routed  MiscBus).  From  Fig¬ 
ures  4  and  5  (we  only  show  one  M-Machine  plot  for  brevity), 
we  see  that  the  model  is  a  good  fit  for  the  wire  length  distribu¬ 
tions  of  these  designs,  which  span  a  wide  range  of  gate  count. 
The  outliers  in  the  M-Machine  data  are  from  long  buses. 
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Fig.  4.  M-Machine  IUCtrl  unit 
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Fig.  5.  Flash  MAGIC  node  controller 

The  percentage  of  wires  that  exceed  Lcrit  for  a  variety  of 
gate  design  sizes  and  process  technologies  is  shown  below. 


From  the  above  analysis  we  can  conclude  that  we  are  not 
heading  for  a  technology  node  in  which  there  will  be  a  cata¬ 
strophic  failure  in  module-level  CAD  tools  such  that  the  timing 
of  most  nodes  in  the  system  will  deviate  significantly  from  the 
pre-synthesis  estimate.  However,  simply  because  module-level 
tools  will  not  perform  significantly  worse  in  the  future  does  not 
mean  that  there  will  be  no  need  for  improvement.  Specifically 
the  exponentially  increasing  design  complexity  will  force  a 
need  for  an  improvement  in  the  module  level  tools. 

Scaling  has  two  effects  on  the  design  -  it  increases  the  per¬ 
centage  of  wires  that  exceed  Lcrit,  and  it  increases  the  total 
number  of  wires  in  the  design.  Scaling  from  0.25pm  to  0.05pm 
doubles  the  percentage  of  wires  greater  than  Lcrit  in  each  50K 
block,  but  also  increases  the  number  of  blocks  by  25.  To  keep 
the  design  time  reasonably  constant,  the  percentage  of  excep¬ 
tions  must  fall  by  at  least  25.  Given  these  numbers,  arguing 
about  exactly  how  the  number  of  wires  that  exceed  Lcrit  scales 
for  a  fixed  sized  block  misses  the  real  problem.  The  exponen¬ 
tially  increasing  design  complexity  causes  an  exponential 
growth  in  the  number  of  blocks.  So  even  if  the  number  of 
exceptional  wires  in  a  block  is  constant  under  scaling,  the 
number  of  exceptional  wires  in  the  total  design  is  growing  rap¬ 
idly.  The  tools  must  be  much  better  at  solving  wire  problems  to 
keep  designer  productivity  growing. 

5.2  CAD  Implications 

As  the  design  is  scaled  to  the  next  technology  generation, 
the  capacity  of  a  same-sized  die  doubles,  and  thus  the  com¬ 
plexity  and  gate  count  of  the  design  has  grown.  If  the  designer 
now  owns  twice  as  many  of  these  50K  gate  blocks,  then  either 
she  has  to  manually  intervene  for  twice  as  many  wires,  or  the 
CAD  flow  must  now  be  able  to  handle  wires  longer  than  Lcriv 
The  cumulative  wire  distribution  function  from  Davis  [13] 
gives  the  longest  wire  (in  gate  pitches)  as  the  percentage  of 
wires  varies,  as  shown  below.  Here,  area  utilization  is  assumed 
to  be  60%. 
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Fig.  7.  Wire  distribution  with  percentage  of  wires 

The  slope  of  this  function  tells  us  what  extra  wire  length 
the  CAD  flow  must  handle  to  reduce  the  designer’s  workload 
appropriately.  For  the  example  in  Figure  7,  with  a  50K  gate 
design  for  which  0.8%  of  the  wires  are  manually  handled,  the 
additional  wire  length  the  CAD  flow  must  handle  for  a  0.4% 
gain  in  percentage  of  wires  is  about  32  gate  pitches.  In  a 
0.25pm  technology  with  a  10pm  gate  pitch  and  a  60%  area  uti- 


Fig.  6.  Percentage  of  wires  that  exceed  Lcrit 


lization,  this  additional  length  is  around  510jim  (added  onto 
the  Lcrit  of  1.3mm),  an  increase  of  nearly  40%  in  length  and  a 
factor  of  2  in  wire  delay. 

From  these  results  we  can  see  that  as  processes  scale  and 
designs  grow  in  complexity,  designer  workloads  for  manually 
fixing  long  nets  will  increase.  If  the  CAD  tools  do  not  improve, 
the  design  time  and  cost  will  scale  up  rapidly.  If  the  tools  do 
improve  to  handle  even  a  slightly  larger  percentage  of  nets,  the 
wire  length  that  they  must  handle  grows  rapidly,  and  the  cur¬ 
rently  ignored  wire  R  will  have  to  be  taken  into  account. 

This  need  for  fewer  exceptional  wires  is  the  core  reason 
that  researchers  are  exploring  new  methodologies  that  use  vari¬ 
able  width  routing,  metal  promotion,  synthesis-driven  layout 
techniques  [16],  layout-driven  synthesis  techniques  [3],  post¬ 
layout  resynthesis  optimizations  [17]  [18],  and  combining  lay¬ 
out  and  synthesis  [19]. 

6.0  Conclusions 

From  the  above  analysis,  we  can  see  that  the  performance 
of  today’s  CAD  tools  on  module-level  blocks  will  not  be  sig¬ 
nificantly  worse  in  the  future.  However,  as  technology  scales, 
this  level  of  performance  will  no  longer  be  good  enough. 
Assuming  that  the  design  team  size  remains  constant,  the  num¬ 
ber  of  modules  that  an  individual  designer  is  responsible  for 
grows  exponentially.  Given  that  the  number  of  CAD  tool 
exceptions  that  a  designer  can  manually  handle  is  fixed,  the 
number  of  exceptions  per  module  must  decrease  exponentially. 
Thus,  the  CAD  tools  will  have  to  improve  to  handle  longer  and 
longer  wires,  in  order  to  decrease  the  percentage  of  exceptions, 
so  that  the  number  of  exceptions  per  designer  remains  con¬ 
stant.  Fundamentally,  it  is  not  that  the  CAD  tools  will  perform 
much  worse  in  the  future,  but  rather  that  the  pressure  of  ever 
improving  process  technology  will  require  them  to  perform 
much  better. 
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